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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Lai, Preeti 

6 Bandman, Olga 

8 (ii) TITLE OP INVENTION: NOVEL HUMAN SODIUM- DEPENDENT 

9 PHOSPHATE CO -TRANSPORTER 
11 (iii) NUMBER OF SEQUENCES: 7 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc 

15 (B) STREET: 3174 Porter Drive 

16 (C) CITY: Palo Alto 

17 (D) STATE: CA 

18 (E) COUNTRY: US 

19 (F) ZIP : 94304 

21 (v) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Diskette 
2 3 (B) COMPUTER: IBM Compatible 

24 (C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ Version 2.0 
27 (vi) CURRENT APPLICATION DATA: 

C--> 28 (A) APPLICATION NUMBER: US/09/965 f 522 

C--> 29 (B) FILING DATE: 26-Sep-2001 

30 (C) CLASSIFICATION: 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/391,958 

34 (B) FILING DATE: 1999-09-08 

37 (viii) ATTORNEY/AGENT INFORMATION: 

38 (A) NAME: Billings, Lucy J. 

39 (B) REGISTRATION NUMBER: 36,749 

40 (C) REFERENCE/DOCKET NUMBER: PF-0221 US 

42 (ix) TELECOMMUNICATION INFORMATION: 

43 (A) TELEPHONE: 415-855-0555 

44 (B) TELEFAX: 415-845-4166 
47 (2) INFORMATION FOR SEQ ID NO : 1 : 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 401 amino acids 

51 (B). TYPE: amino acid *• 

52 (C) STRANDEDNESS: single 

53 (D) TOPOLOGY: linear 

55 (vii) IMMEDIATE SOURCE: 

56 (A) LIBRARY: BRAITUT02 

57 (B) CLONE: 754412 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

61 Met Gin Val Asp Glu Thr Leu lie Pro Arg Lys Val Pro Ser Leu Cys 

62 1 5 10 15 
6 3 Ser Ala Arg Tyr Gly lie Ala Leu Val Leu His Phe Cys Asn Phe Thr 
64 20 25 30 
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65 Thr He Ala Gin Asn Val He Met Asn He Thr Met Val Ala Met Val 

66 35 40 45 

67 Asn Ser Thr Ser Pro Gin Ser Gin Leu Asn Asp Ser Ser Glu Val Leu 

68 50 55 60 

69 Pro Val Asp Ser Phe Gly Gly Leu Ser Lys Ala Pro Lys Ser Leu Pro 

70 65 70 75 80 

71 Ala Lys Ser Ser He Leu Gly Gly Gin Phe Ala He Trp Glu Arg Trp 

72 85 90 95 

73 Gly Pro Pro Gin Glu Arg Ser Arg Leu Cys Ser He Ala Leu Ser Gly 

74 100 105 110 

75 Met Leu Leu Gly Cys Phe Thr Ala He Leu He Gly Gly Phe He Ser 

76 115 120 125 

77 Glu Thr Leu Gly Trp Pro Phe Val Phe Tyr He Phe Gly Gly Val Gly 

78 130 135 140 

79 Cys Val Cys Cys Leu Leu Trp Phe Val Val He Tyr Asp Asp Pro Val 

80 145 150 155 160 

81 Ser Tyr Pro Trp He Ser Thr Ser Glu Lys Glu Tyr He He Ser Ser 

82 165 170 175 

83 Leu Lys Gin Gin Val Gly Ser Ser Lys Gin Pro Leu Pro He Lys Ala 

84 180 185 190 

85 Met Leu Arg Ser Leu Pro He Trp Ser He Cys Leu Gly Cys Phe Ser 

86 195 200 205 

87 His Gin Trp Leu Val Ser Thr Met Val Val Tyr He Pro Thr Tyr He , 

88 210 215 220 

89 Ser Ser Val Tyr His Val Asn He Arg Asp Asn Gly Leu Leu Ser Ala 

90 225 230 235 240 

91 Leu Pro Phe He Val Ala Trp Val He Gly Met Val Gly Gly Tyr Leu 

92 245 250 255 

93 Ala Asp Phe Leu Leu Thr Lys Lys Phe Arg Leu He Thr Val Arg Lys 

94 260 265 270 

95 He Ala Thr He Leu Gly Ser Leu Pro Ser Ser Ala Leu He Val Ser 

96 275 280 285 

97 Leu Pro Tyr Leu Asn Ser Gly Tyr He Thr Ala Thr Ala Leu Leu Thr 

98 290 295 300 

99 Leu Ser Cys Gly Leu Ser Thr Leu Cys Gin Ser Gly He Tyr He Asn 

100 305 310 315 320 

101 Val Leu Asp He Ala Pro Arg Tyr Ser Ser Phe Leu Met Gly Ala Ser 

102 325 330 335 

103 Arg Gly Phe Ser Ser He Ala Pro Val He Val Pro Thr Val Ser Gly 

104 340 345 350 

105 Phe Leu Leu Ser Gin Asp Pro Glu Phe Gly Trp Arg Asn Val Phe Phe 

106 355 360 365 

107 Leu Leu Phe Ala Val Asn Leu Leu Gly Leu Leu Phe Tyr Leu He Phe 

108 370 375 380 

109 Gly Glu Ala Asp Val Gin Glu Trp Ala Lys Glu Arg Lys Leu Thr Arg 

110 385 390 395 400 

111 Leu 

114 (2) INFORMATION FOR SEQ ID NO : 2: 
116 (i) SEQUENCE CHARACTERISTICS: 
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117 
118 
119 
120 
122 
123 
124 
126 



(vii) 



(xi) 



(A) LENGTH: 1643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 

SEQUENCE DESCRIPTION: SEQ ID NO : 2: 



128 AGAACGGTGA GGATGACCGA CGTATAGGCG AGAGCCTAGG TACGCCATGC CAGGTCACCG 

129 GTCCGGCAAT TCCCGGGTCG ACCCACGCGT CCGCTTGGAG GGACGCTGGG TTCAACTTGA 

130 AGCCCTTCCA CAGACATTAA GTCGGTGAAA ACCATTCACT AGGAGAGGAG AAACACAATG 

131 GCCACCAAGA CAGAGTTGAG TCCCACAGCA AGGGAGAGCA AGAACGCACA AGATATGCAA 

132 GTGGATGAGA CACTGATCCC CAGGAAAGTT CCAAGTTTAT GTTCTGCTCG CTATGGAATA 

133 GCCCTCGTCT TACATTTCTG CAATTTCACA ACGATAGCAC AAAATGTCAT CATGAACATC 

134 ACCATGGTAG CCATGGTCAA CAGCACAAGC CCTCAATCCC AGCTCAATGA TTCCTCTGAG 

135 GTGCTGCCTG TTGACTCATT TGGTGGCCTA AGTAAAGCCC CAAAGAGTCT TCCTGCAAAG 

136 TCCTCAATAC TTGGGGGTCA GTTTGCAATT TGGGAAAGGT GGGGCCCTCC ACAAGAACGA 

137 AGCAGACTCT GCAGCATTGC TTTATCAGGA ATGTTACTGG GATGCTTTAC TGCCATCCTC 

138 ATAGGTGGCT TCATTAGTGA AACCCTTGGG TGGCCCTTTG TCTTCTATAT CTTTGGAGGT 

139 GTTGGCTGTG TCTGCTGCCT TCTCTGGTTT GTTGTGATTT ATGATGACCC CGTTTCCTAT 
14 0 CCATGGATAA GCACCTCAGA AAAAGAATAC ATCATATCCT CCTTGAAACA ACAGGTCGGG 

141 TCTTCTAAGC AGCCTCTTCC CATCAAAGCT ATGCTCAGAT CTCTACCCAT TTGGTCCATA 

142 TGTTTAGGCT GTTTCAGCCA TCAATGGTTA GTTAGCACAA TGGTTGTATA CATACCAACT 
14 3 TACATCAGCT CTGTGTACCA TGTTAACATC AGAGACAATG GACTTCTATC TGCCCTTCCT 
144 TTTATTGTTG CCTGGGTCAT AGGCATGGTG GGAGGCTATC TGGCAGATTT CCTTCTAACC 
14 5 AAAAAGTTTA GACTCATCAC TGTGAGGAAA ATTGCCACAA TTTTAGGAAG TCTCCCCTCT 
14 6 TCAGCACTCA TTGTGTCTCT GCCTTACCTC AATTCCGGCT ATATCACAGC AACTGCCTTG 
14 7 CTGACGCTCT CTTGCGGATT AAGCACATTG TGTCAGTCAG GGATTTATAT CAATGTCTTA 
148 GATATTGCTC CAAGGTATTC CAGTTTTCTC ATGGGAGCAT CAAGAGGATT TTCGAGCATA 
14 9 GCACCTGTCA TTGTACCCAC TGTCAGCGGA TTTCTTCTTA GTCAGGACCC TGAGTTTGGG 

150 TGGAGGAATG TCTTCTTCTT GCTGTTTGCC GTTAACCTGT TAGGACTACT CTTCTACCTC 

151 ATATTTGGAG AAGCAGATGT CCAAGAATGG GCTAAAGAGA GAAAACTCAC TCGTTTATGA 

152 AGTTATCCCA CCTTGGATGG AAAAGTCATT AGGCACCGTA TTGCATAAAA TAGAAGGCTT 

153 CCGTGATGAA AATACCAGTG AAAAGATTTT TTTTTCCTGT GGCTCTTTTC AATTATGAGA 

154 TCAGTTCATT ATTTTATTCA GACTTTTTTT TGAGAGAAAT GTAAGATGAA TAAAAATTCA 

155 AATAAAATGA TAACTAAGAA TGC 

157 (2) INFORMATION FOR SEQ ID NO : 3: 

159 (i) SEQUENCE CHARACTERISTICS: 

160 (A) LENGTH: 467 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS: single 

163 (D) TOPOLOGY: linear 

165 (vii) IMMEDIATE SOURCE: 

166 (A) LIBRARY: GenBank 

167 (B) CLONE: 450532 

169 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

171 Met Gin Met Asp Asn Arg Leu Pro Pro Lys Lys Val Pro Gly Phe Cys 

172 15 10 15 

173 Ser Phe Arg Tyr Gly Leu Ser Phe Leu Val His Cys Cys Asn Val lie 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1643 
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174 



20 



25 



30 



175 He Thr Ala Gin Arg Ala Cys Leu Asn Leu Thr Met Val Val Met Val 

176 35 40 45 

177 Asn Ser Thr Asp Pro His Gly Leu Pro Asn Thr Ser Thr Lys Lys Leu 

178 50 55 60 

179 Leu Asp Asn He Lys Asn Pro Met Tyr Asn Trp Ser Pro Asp lie Gin 

180 65 70 75 80 

181 Gly He He Leu Ser Ser Thr Ser Tyr Gly Val He He He Gin Val 

182 85 90 95 

183 Pro Val Gly Tyr Phe Ser Gly He Tyr Ser Thr Lys Lys Met He Gly 

184 100 105 110 

185 Phe Ala Leu Cys Leu Ser Ser Val Leu Ser Leu Leu He Pro Pro Ala 

186 115 120 125 

187 Ala Gly He Gly Val Ala Trp Val Val Val Cys Arg Ala Val Gin Gly 

188 130 135 140 

189 Ala Ala Gin Gly He Val Ala Thr Ala Gin Phe Glu He Tyr Val Lys 

190 145 150 155 160 

191 Trp Ala Pro. Pro Leu Glu Arg Gly Arg Leu Thr Ser Met Ser Thr Ser 

192 165 170 175 

193 Gly Phe Leu Leu Gly Pro Phe He Val Leu Leu Val Thr Gly Val He 

194 180 185 190 

195 Cys Glu Ser Leu Gly Trp Pro Met Val Phe Tyr He Phe Gly Ala Cys 

196 195 200 205 

197 Gly Cys Ala Val Cys Leu Leu Trp Phe Val Leu Phe Tyr Asp Asp Pro 

198 210 215 220 

199 Lys Asp His Pro Cys He Ser He Ser Glu Lys Glu Tyr He Thr Ser 

200 225 230 235 240 

201 Ser Leu Val Gin Gin Val Ser Ser Ser Arg Gin Ser Leu Pro He Lys 

202 245 250 255 

203 Ala He Leu Lys Ser Leu Pro Val Trp Ala He Ser He Gly Ser Phe 

204 260 265 270 

205 Thr Phe Phe Trp Ser His Asn He Met Thr Leu Tyr Thr Pro Met Phe 

206 275 280 285 

207 He Asn Ser Met Leu His Val Asn He Lys Glu Asn Gly Phe Leu Ser 

208 290 295 300 

209 Ser Leu Pro Tyr Leu Phe Ala Trp He Cys Gly Asn Leu Ala Gly Gin 

210 305 310 315 320 

211 Leu Ser Asp Phe Phe Leu Thr Arg Asn He Leu Ser Val He Ala Val 

212 325 330 335 

213 Arg Lys Leu Phe Thr Ala Ala Gly Phe Leu Leu Pro Ala He Phe Gly 

214 340 345 350 

215 Val Cys Leu Pro Tyr Leu Ser Ser Thr Phe Tyr Ser He Val He Phe 

216 355 360 365 

217 Leu He Leu Ala Gly Ala Thr Gly Ser Phe Cys Leu Gly Gly Val Phe 

218 370 375 380 

219 He Asn Gly Leu Asp lie Ala Pro Arg Tyr Phe Gly Phe lie Lys Ala 

220 385 390 395 400 

221 Cys Ser Thr Leu Thr Gly Met He Gly Gly Leu He Ala Ser Thr Leu 

222 405 410 415 
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Val 
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Glu 


He 


Gin 
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Glu 
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His 


228 




450 
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460 










229 
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Arg 
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465 
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(2) : 


INFORMATION FOR SEQ ID NO : 4: 
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(i) 


SEQUENCE CHARACTERISTICS: 
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(A) 


LENGTH 


: 56C 


) amino acids 
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(B) 


TYPE: amino acid 
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(C) 


STRANDEDNESS : single 
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(D) 


TOPOLOGY: linear 


















94. f) 

z. *± u 


(vii) 


IMMEDIATE SOURCE: 
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(A) 


LIBRARY: GenBank 


















949 






(B) 


CLONE : 


507415 
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SEQUENCE DESCRIPTION: SEQ ID NO: 


: 4 : 












9 4 


Met 


Glu 


Php 




OX 11 




V7X U 


true 


Arg 


Lys 


Leu 


Ala 


Gly 


Arg 


Ala 


Leu 


94 7 


1 








5 










10 










15 




94 fl 


Glv 


Arg 


Leu 


His 


Arg 


Leu 


Leu 


Glu 


Lys 


Arg 


Gin 


Glu 


Gly 


Ala 


Glu 


Thr 


94^ 








20 










25 










30 






9 


Leu 


Glu 


Leu 


Ser 


Ala 


Asp 


Gly 


Arg 


Pro 


Val 


X 11X 


X I1X 


His 


Thr 


Arg 


Asp 


9 SI 

j£ J -L 






35 










40 










45 








9 S9 


Pro 


Pro 


Val 


Val 


Asp 


Cys 


Thr 


Cys 


Phe 


Gly 


T.^i 1 
XjC \JL 


Pro 


A f-rr 
rtX y 


rtx y 


xyx 


He 


9 ST 




50 










55 










U \J 










9 S4 


He 


Ala 


He 


Met 


Ser 


Gly 


Leu 


Gly 


Phe 


Cys 


11C 


C o T" 
OCX 




VjX 


He 


Arg 


9 S S 

z -j -j 


65 










70 










75 










80 


9 Sfi 

Z J D 


Cys 


Asn 


Leu 


Gly 


val 


Ala 


He 


Val 


Ser 


Met 


Val 


As n 


Asn 


Ser 


Thr 


Thr 


257 










85 










90 










95 




9 Sfl 

Z> <J 


His 


Arg 


Gly 


Gly 


His 


Val 


Val 


Val 


Gin 


Lys 


Ala 


Gin 


Phe 


Asn 


j.x y 


Asp 


259 








100 










105 










110 






260 


Pro 


Glu 


Thr 


Val 


Gly 


Leu 


He 


His 


Gly 


Ser 


Phe 


Phe 


±J -br 


Glv 


Tvr 

X Jf X 


He 


261 






115 










120 










125 








9fi 9 

z, D Zt 


Val 


Thr 


Gin 


He 


Pro 


Gly 


Gly 


Phe 


He 


Cys 


Gin 


Lys 


Phe 


Ala 


Ala 


Asn 


263 




130 










135 










140 










264 


Arg 


Val 


Phe 


Gly 


Phe 


Ala 


He 


Val 


Ala 


Thr 


Ser 


Thr 


Leu 


Asn 


Met 


Leu 


265 


145 










150 










155 










160 


266 


He 


Pro 


Ser 


Ala 


Ala 


Arg 


Val 


His 


Tyr 


Gly 


Cys 


Val 


He 


Phe 


Val 


Arg 


267 










165 










170 










175 




268 


He 


Leu 


Gin 


Gly 


Leu 


Val 


Glu 


Gly 


Val 


Thr 


Tyr 


Pro 


Ala 


Cys 


His 


Gly 


269 








180 










185 










190 






270 


He 


Trp 


Ser 


Lys 


Trp 


Ala 


Pro 


Pro 


Leu 


Glu 


Arg 


Ser 


Arg 


Leu 


Ala 


Thr 


271 






195 










200 










205 








272 


Thr 


Ala 


Phe 


Cys 


Gly 


Ser 


Tyr 


Ala 


Gly 


Ala 


Val 


Val 


Ala 


Met 


Pro 


Leu 


273 




210 










215 










220 










274 


Ala 


Gly 


Val 


Leu 


Val 


Gin 


Tyr 


Ser 


Gly 


Trp 


Ser 


Ser 


Val 


Phe 


Tyr 


Val 


275 


225 










230 










235 










240 


276 


Tyr 


Gly 


Ser 


Phe 


Gly 


He 


Phe 


Trp 


Tyr 


Leu 


Phe 


Trp 


Leu 


Leu 


Val 


Ser 
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